Research that powers transparent, reliable, and effective AI models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Al is rapidly being adopted by people in nearly every industry.

Frontier labs are fine-tuning their models for higher quality outputs, but the industry still does not have a deep understanding of why Al models say what they say.

This lab is focused on scaling the interpretability research necessary to make better AI systems possible.